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Methods of OBTAimfJG isoform s pecific: expression in mammauan cells 
FIELD OF INVENTION 

The present invention relates to the field of gene expression, in particular 10 inhit^iting tlie 
expression q1 a spe^fic isofonn of a gene in mammalian cells using douDle-strancifiQ 
RNA. The doLiMe^randed RNA based technology of the invention has wide 
appiicationSp such as for determming funchon of a particular gene isoform or developing 
therapeutic methods for treating diseases. 

&ACKGROUNP OF THE INVENTION 

Many eukaryotio genes are expressed as multiple isofarms through the oifTerential 
wtilization of transcnption and translation inibation sites or through alternative sphdng, 
thus giving nse to proteins of related sequence hut with Diocnemicatly as well as 
biotogically distinct features (Andreadis at ai., 1987. Annu. Rev. Cell Biol. 3, 207-242>. 
Although, in many cases, multiple isoTorms are expre$sed in the same ceil at the same 
time, the expression level and pattern of each isoform may vary with cell type and 
development! stage, rnajdng me study of each isoform confusing and dilRculL 

The conventional approach for investigating the function of Individual isofonms requires 
conditions wnere only one isoform is expressed or eJiminat^d to provide a clean 
DacKground (i.e. without the influence of other isofonns). Thus, typically the desired 
isoform has been expressed in cells or whole organisms in whici^ the target gene has 
been deleted, a process that is time consuming. Wnile several protocols for tissue- 
specific expression or elimination of a gene of interest have been developed (4ohnscn et 
al , 198d, MoL Ceil Btol. 9. 3293-3399; Tan, 1991, Dev. Biol. 146, 24-37; Qorman and 
BullocH, 2000, Curr. Qpin. Biotechnol. 11, 455-460), reports of isofbrm-spetific gene 
inactivation (l^^liaccio et al., 1999, Nature 402. 309-313} or expression in a ciaan 
bacKground are limited. Such protocols are usually lengthy, irfken requinng several 
months to establish the desired conditions. Dunng this time, cells or organisms may 
adapt to the new conditions (Muller, U.. 1999, Mech Dev. 82. 3-21 } and caution is 
required when interpreting the results. 
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in me last few years, advances in nudefp gcicj chemistry and gene transfer have Inspired 
new approaches to engmeer spe<^o inhit^ition of gene i^cpression. Anttsensi^ technology 
nas Deen the most cxsmmonly de^crioed approach in prcnocols to achieve fien^specific 
jnhit^ition, although more reoenay targ^ed gene silencing based on double^strandad 
RNA (dsRNA) inducing a response called post-transcripnonai gene silencing (PTQS) or 
RNA interference (RNAi) has been (^oposed. PT<SS is a phenomenon oi^inaHy reported 
in plants (van der KroJ et al., 1990, Plant Cell 2. 2gi-299; Napoli. et ai., 1990, Plant Cell 
2, 931-943), wher^ introduction of a iransgene causes silenong of the endogenous 
nomoiogous gene as wen as itself. The mechanism of PTGS involves enhanced mRNA 
degradation wrth double-stranded (ds}RNA as the trigger (Cogam ana Macino, 2000, 
Cunr. Opin, Genet- Dev. 10, 638-643; Canhew, 2001, Cum Opm. Cell BioL 13,244- 
248). A Similar phenomenon (qMeiiir^) was obsen/ed in Neurospora (Cogoni ei al., 1994, 
Antonie Van ueeuwennoeK 65, 205-209). The mechanism underlying RIMAi has heen 
partially elucidated and a 21- to 23-nt-long dsRNA was found to be the 
mtemiedia^/nnediator of mRNA decay (Zamore et al.. 2000, Cell lOi, 25-33; Bernstein 
et al., 2001, Nature 409. 363-366) 

In ttie animal Kingdonn, dsRNA-mediated gene silencing was first described m ttie 
nematode C. e^egans (Fire et a|., 1998. Nature 391, 808-811) and was termed RNA 
interference (RN^) Subsequently, RNAi nas been observed in a wide range of 
organisms including flies, trypanosomes, hydra, ze^rafish and mice oocytes (Cogoni and 
Maano. 2000; Bosner and Uit>ouesse, 2000, Nat. Cell Biol, 2. E31-36). Although RNAi 
ha^; been used extensively In non-mammalian systems, dsRNA inhibition has only 
recently been applied succe^afiilly in mammalian systems, in particular, an efficient 
posttranscriptfonal gane-silenang method ha$ been reported that employs a smait 
interfering doubie-stranded RNA, siRna, in mammalian oeiis (Sbastnr et ai.. 2001, 
Nature 41 1 , 494-498). The auttiors describe that transfection of 21 *nuO|eatide dsRNA 
(siRNA) can tngger PTGS of txsth the co-transfected and the endogenous gene in 
cultured mammalian oelts. 

using a cell-free system of dsRlsiA-mediaied mRNA decay, mRNA was shown to be 
cleaved in Orosopnila embryonal cell extracts withm the region of identity with the 
dsRNA (Zamore ex ai., 2000). However, tne potency of RNAi m worms and flies 
suggests that dsRNA acts cataiytically and/or is amplified (Zamore, 2001 , Nature 
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^ructurai Bioiogiy &. 746-750). Although it has been suggested that si RNAs are not 
replicated by endogenous RNA-dependent RNA polymerdse in C. elegans. data now 
exists supporting a model whereby siRNAs irvleed act as primers fbr ceilujar 
polymerases ^nd are extended to fbrvn ds RNAs in both C^iegans and Drosophila 
(Upardi et aU. 2001, Cell 107, 297-307; 81460 et al., 2001, Cell 107, 465-476; Nishikura. 
2001. Cell 107, 415-416). Propagation of RNAi to other regions of the mRNA in ihis way 
would result in the spread of gene silencing to related seqiienoes, thereby not allowing 
isoform^spetific dsRNA jnhibifion. 

There is a need in the art for a system to evaluate gene function, in particular for the 
evaluation of isoform function in mammalian cells, as wen as for therapies depenaam on 
dsRNA inhibition of closely r^ted mRNAs In mammaiian cells and this invention meets 
that need. 

SUMMARY OF THE INVENTION 

The present invention provides a method of expressing a desired isoform of a gene 
product 5n a cell absent undesired isofonns of the gene product, comprising the steps of: 
(a) exposing a mammalian cell to at least one nucleic acid that is at least a panially 
double-stranded ribonucleic acid, the double-stianded portion having at least 95% 
sequence identity to a common nucleic acid sequence shared by two or more isoforms 
of the gene product and (b) imnoduong an expression vector encoding a desired 
isoform of the gene product into the mammaUan cell, me desired isoform having a 
sequence compnaing one or more mismaichas reiativd to die double*etranded portion of 
the nucleic add, operably linKed to a pronvster capable of driving expression of the 
desired isoform in the cell. Preferably, the common nucleic acid sequence i$ at least 19 
consecutive nucleotides in length and is commcHi to all endogenous isoform? of the gene 
product in the ceil. 

The nucleic acid is preferably 19 to 25 nucieoildes long and the double-stranded portion 
of the nucleic acid is preferably 100% identical to the common nucleic acid sequence in 
pardcular, the at least pamalfy double-stranded nbonudeic acid contains a double- 
stranded portion of at least 19 nucleotides, preferably 19 nucleotides, and at least one 
two-nucleottde singie-siranded 3' overhang, preferably having two. 
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In preferred emt^odiments, the desired isoform is descntjed ijy a sequence compnsfng 
two or more mismatcri<^s relative to the dou&le-stranded portion of the nucleic ^cid. This, 
the expression vector encoding the desired isoform can have at (east one oodon (e«g., 
tuvo or more) that differs from the erviogenous sequence coding the desired tsofonm. 
Preferaoly, the desired isoform has an identical protein seQuence to Uie corresponding 
endogenous isoform. 

In one emimdiment, the desired isofomi replaces a mutant isoform esqsressed tay the 
cell, aUcwirtQ correct gene function. Thus, the mutant isoform can t^e oncogenic, 
apoptoptc, tumor Suppressive, inilammai^on inducive or suppressive, or angiogenic 

In one embodiment, the method of the invention is used to determine the function of the 
desired isoform {i.e, isoform of unRnown function) 

Tne method is useful In alltypes of mammalian ceus. irtciuding cancer calls, SuCh as 
HeLa (cervical cancer), PC3 (prostate cancer). MPA-MB-231 (breast cancer) and MCF- 
7, or cell m vhfo. 

The desired isoform can transcritied under the control of an endogenous promoter, a 
constiti4ive pmmoter. an induciMe promoier or a ttssue-speciHc promoter , tor example. 

The invention also provides metnods of assigning function to a desired isofomn of 
unMown function. Also provided are Kits and materials for carrying out the invention, as 
well as cells exhibiting isofonm^^pecific expression m a <dean bacKground- The invention 
also encompasses therapeutic uses and compositions t>ased on the methcMte of the 
invention. 



DETAILED DESCRIPTlOfS OF THE INVENTION 

In its broadest aspect, the present invention provides a method of expressing a desired 
isoform of a gene product \u a ceil absent undesired isoforms of a gene product, by 
exposing a mammalian ceil to at least one nucleic acid, the nucieic acid be^ng at least a 



10027013 23-Jan-02 06^1 



5 

partially dQubie-strandea rtbonucieic acid an«i the douWe-stranclecj portion having at 
least 95% sequence identity to a common nucleic aad sequence shared by two or more 
isoforms of the gene pitKlua; and Introducing an expresskan vector enoofling a desired 
isoform of the gene product into the mammalian cell, the desired isotorm having a 
sequence comprising one or more mtematches relative to the doMWe-strgnded portion of 
the nudeic acio, operahly tinKed to a promoter capable of driving expression of the 
desired isoform in said ceil. 

The nucleic acid will typically he a ntwnucieic acid (RNA) that in douDle-str^ed form 
has at least 96%, preferatily 98% most preferably 1 00% sequence identity to a common 
nudeiC acto sequence shared tsy two or more isoforms of the gene product. Pfeferred 
RNA molecules lof inhibition comprise sequences identical to a common nucleic acid 
sequence shared by two or mor^ fsoforms of the gene product over at least 15—25 
consecutive bases, preferably over at least 19 conseculive bases. Preferably the 
common nucleic acid seqMence is common to all endogenous isqfomne of the gene 
product expressed by the ceil of interest The RNA sequence is preferably chosen to 
have identity with exon sequences, in particular, the double-stranded portion of the 
nucleic add ts preferably 100% identical to the common nucleic acid sequence. Gene 
expression it* inhiWted m a sequence-specific manner in that nucleotide sequences 
corresponding to the dufHex region of the RNA are targeted for mhlbfton. 

Sequence identity (and specificity ^ he. to ensure that the nucleic acid is specific and 
does not cross-react with other unrelated gene sequences) may be optimized and 
common nucleic acid sequence detennined by sequence comparison and alignment 
algorfthms Known in the art (see Gribskov ana Devereux, Sequence Analysis Primer, 
StocKton Press, 1991. and references cSted therein) and calculating the peroent 
difference between the nucleotide sequences by. for example, Uie Smith-Waterman 
algorithm as implemented in the BESTFIT software program using default parameters 
(e-g.. University of Wisconsin Genetic Computing Group). 

The nucleic acid will typically be relatively short for use in mammalian cells to allow 
efficiem delivery of the nucleic add to the cell and also to avoid any PKR response of ttie 
ceil. Thus, ^e nucleic acios will ^picaliy be 19 to 25 nucleotides long, mors preferably 
the nucleic acid comprises a double-stranded portion of at least 19 nucleotides ana at 
least one two-nucleotide smgie-stranded 3' overhang, preferably two. Alternatively, other 



ways of drcmnverttina non-spedlic ceuular responses to dsRNA or more effiraent 
delivery methods can be u9^. 

The at least partially doubls-stranded RNA can t>e Ibrmed by a self-compiementaiy RW^ 
strand (sucn as a transcript having an inverted repeat), or Dy iwwo or more 
complementary RNA strands. RNA duplex formation may be initiated either inside or 
outside the cell. The RNA is introduced in an amount that allows delivery of at least one 
copy per ceil, preferaWy at least 5. iO, 100. 500 or 1000 copies per cell, depending on 
me application. The amognt jntroduceo is dependent on the dasirsd effect and can Pe 
easily determineo empidcaily. 

The nucieie add may comprise nuciaotides or linkages other than those tnat occur 
naturally in ribonucleic acid, for example, to stabilize the dsRNA from degfada^on, 
espedally when RNA is delivered to a cell and not produced by Itie cell Thus, one can 
employ oiigQfibonucieoudes or otigonutdeotides that comprise one or mom modified (i.e„ 
synthetic or nan-oatiira«y oc^ufring) nucteotides. Usually, nucleotide monomers in a 
nud^ acid are linked tsy ptiosphodiester Donds or analogues mereof. Analogues of 
phosphodiester Kntuagas include phosphgrothioate. pnosphorodithioate, 
pnosphoroseienoate, phospt^orodiseienoate, phospnonoaniiaihioate. phosphoranilidate, 
phosphoramidate. peptide, and the liKe linKages. Those of skill in the ait will recognize 
thatine reagents employed are commercially available or, in the case of the 
oligonucleotides, can be prepared using commerctelly avaUaWe instn«nentation. 
Pnefarably the duplet RNA wiH comprise ribonucleotide units or eahar nudeotide units 
tnat allow appropriate pracessins by the ceil and efficient mhibUlon of the isolonns 
expressed by the cell, 7?ie nudeic acid (typically RNAJ may be synthescteo either in vivo 
or in yitro, preferably in vftra In particular when Short sequences are employed- 
EndogenoMS RNA polymerase of the cell may mediate wanscnption in vivo, or ciopeq 
RNA polymerase can be used foriranscripTitm in vivo or in vitro, essentially as descnbed 
below. 

Thus, in one step of the rnethod of the invention a mammalian cell is exposed to m» 
nucleic acid described above to inhibit expression of specific isomers expressed in a cell 
of intei«si. preferably to inhibit all related isomers. The term -isomer" is *e|i Known m the 
art and is meant to encompass gene products that are produced as a result of 
differential gene splicing as well as from the use of alternative transcnption and 
translation start sites. In addition, for the purpose of the present invention, the tenn 
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isomers inciuoe any closely fBiqtB0 sequencses and therefore may jnclHde q mutate 
gene in a cell, smcsi as daieterioua point mutations and ine lite* The mutant iscrfomri can 
be oncogeriic. apop«>tic, tumor suppressive, inflammation inducive or suppressive, or 
angiogenic for wample, ana its detetanous luncbon corrected »>y the method of the 
inx/ention. Although the Examples below illustrate the invention using ShcA, any gene 
product pnsduoed m muituple fonns can Me targeted vising the methods of the present 
invention- For example, such proteins may K>e iherapeufically important proteins, su* as 
enzymes, e.g.. Kinases (PKjB), ras, integnns. e2F, Rb. F6F, other signaling molecules 
and transcription factors. 

The effect of asRNA on gene expression will xyptcalJy result in expression of the target 
isofanns Deing inhibited liy at least 1 0%, 33%. 50%, 90%» 95% or 99% when compared 
to a oeH not treated by this step. 

The mammalian cell can be any oei of interesL Thus, the c^l may be cells from the 
inner cell mass, extraembryonic ectoderm or embryonic stem cells, totipotent or 
pluripotent, dividing or non-dividing, parenchyma or epithelium. immortailizBd or 
transformed, or the liKe. The cell may be a stem cell or a differentiated celU CeU types 
tnat are differentiated indude without limitation adipocytes, fibroblasts, myocytes, 
cardiomyocyies, endothelium, dendritic cells, neurons, glia, mast cells, blood cells and 
leukocytes (e.g.. eryflinscytes, megaKaryotes, lymphocytes, such as B. T and natural 
kiuer cells, macrophages, neutrophils, eosinophils, basophils, platelets, granulocytesX 
epithelial cells. Keratinocytes, chondnacytes, osteoblasts, os^ociasts, tiepatocytes, and 
cells of the endocnna or exocrine glands, ad well as sensory oellsPrefarapty for the 
study of biological function of a particular isomer or for studying protein-protein 
interaciions, cells that are easily cultured are preferred. These may include cancer cells, 
for example, including Heua (cervical cancer), PC3 (prostate cancer), MDA-MB-231 
(breast cancer) and MCF-7 cells. It mil be apparent to one of ordinary skill in the art that 
the teachings of the specification can easily be applied to other aituations. such as 
carioer cells for ine replacement of a mutated Tsoform" with one or more desired 
isoform(s]. 

in a further step, an expression vector encoding a desired isoform of the gone product 
into the mammalian cell, the desired isoform having a sequence comprising one or more 
mismatches relative to the double-strandea portion of the nucieic aad, operably imneo to 
a promoter capable of dnving expression of the desired isoform in said cell. The desired 
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isofbnn preferaWy comprtses a sequence eomprising one or more mismatches, 
praferably two or more, retailve to me doubie^rand^j portion of me nucleic acid used to 
mniblt expression of the endogenous isofonns to ensure that the nuCieic acid (RNAI) 
does not inhibit expression of tne desired isoform. Preferably, the expression vector 
encodes a desired isoform using at least one oodon. more prefenaWy two c»dons, that 
differls) from tne endogenous sequence coding the corresponOing isoform. AMhoygh 
changes in proton sequence are encompassed by the present mvemton between the 
endogenous isoform and isoform under analysis, m particular those substitutions, 
deletion or addifions that oo not affect gene fundtion. preferably conservafive 
substitutions, most pieferapiy. the desired isofQim has an identical protein sequence to 
me corresponding endogenous isofoim (atthouQh the coding sequence will p& arffereni 
due to codon usage) 

For transcription of a transgene (whether the desired isoform or si/dsRNA) in vivo, an 
expression construct compriaing at feast one ragulatory region (e.g . pmmoter, 
ennanoer, silencer, splice donor and acceptor and pdyaaenylaiion signal) operably 
linked to the DNA cooing Ibr the desired RNA transcnpt(s) may be used to transcribe the 
RNA strand (or strands) The promoter can be of almost any origin. Prefen^ed are 
promoters that are active in the chosen host cells like the SV4D, bota-actin, 
metallothionein. T7, potyhedrin and cytomegalovirus promoters. The promoter can be a 
constitutive promoter, an induaoie promoter or a tissue^pedfic pronftoier, Ibr example, 
allowing inhibition to be targeted TO an organ or cell type; or transcription to be induced 
upon stimulation of an environmental condilion Ce.g-, infection, stress, temperature, 
chemical inducers); and/or engineering transcription at a developmental stage or age. 
Alternatively. A KnocR-in cor»siruct can be used to transcribe the nucleic add of interest 
under the control of an endogenous promoter, as is Known in the art. Modified or 
unmodified RNA can also be chemically or enzymaticaily synthesized by manual or 
automated reactions as described above. The RNA may be synthesized by a ceXMar 
RNA polymerase or a bacteriophage RNA polymerase (e.g-, T3. T7. SP6). 

Expression vecmrs may also include sequences allowing for their autonomous 
replication within me host csrganism. sequences that encode genetic traits that allow cells 
containing the vectors to be selected, and sequences that increase tne efficiency with 
wh,ch the RNA is transcnbea. Stable long-tenm vectors may be maintained as freely 
replicating entities by us.ng regulatory elements of viruses. Cell imes may also be 
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produced which nave integrated the vector into ihe genomic DMA and in this manner the 
tf;anscript(s} is/are produced on a continuous pasis. 

Thus, an expression vector can fuitner inaude, if desired, additianal sequences operably 
linKed to a promoter, such as a reporter gene (e.g., fluoresoent proteins, e.g., green 
fluorescent protein, p-galactosjdase, aikaJine phosphatase, luciferase, CAT, selective 
gene marKers that fecjiitate the seieaion of transfarmantfi due to the phenotypic 
expression of the rnart^r gene (e.g.. those expressing antibiotic resistance or, m the 
case ot auxotrophic host mutants, genes which complement nose lesions), or other 
useful sequences. 

Nucleic acids (whether expression vectors or double-stranded RNA}can he introduced 
into a ceil Dy various standard methods in genetic engineering, including physical 
methods, for example, simple diffusion, Oy injection of a solution containing the nucleic 
acid, bomiDardement by particles covered by me nucleic acid, soelcing the cell or 
organism in a solution of the nuctric atiia, or electropor^tion of ceM membranes in the 
pnasence of tne nucleic acid. A particularly preferred method for deiiven'ng nuOeic acids 
is the use of eleccroporation. AHemativeiy, a vlial constnjct accomplishes both efRdent 
introduction of an expression construct into a cell and transcription of RNA encoded by 
the expression construct Other metncqs Known in the art for intnxjucing nucleic acids to 
cells may also De used, ^s |s apparent to the artisan, such as iipid-mediated delivery 
systems, chemical mediated transport, sucn as calcium phosphate transfeciion, OEAE- 
dextran transfection, and me liKe. Tne transfec^ host cells can be cuRursd by standard 
methods in cell culture. 

The methods of tne present Invention are particuiany useful for determining the function 
of a particular i^oform. In tnis aspect of the invention, the method further comprises 
identifying a phenotype of the mammalian cell compared to when the desired isoform is 
absent, and assigning the phenotype to the desired isofonn. 

Tne invention also provides a KA comprising reagems useful in the methods of me 
inventioni which may include a nucidic add tMiing at least a panially double-strandea 
ribonucleic acid ana tne douDie-stranded portion having at least 95% sequence identity 
to a common nudeic acici sequence shared by iwo or more isoforms of the gene 
product: and an expression vector encoding a desired jsoform ot the gene product, tne 
desired isoform having a sequence comprising orie or more mismatches relative to tne 




10 



douDie-stranded portion of the nucleic acid, opeR0(y linked to a promoter capable of 
driving expression of the desired isoform in $aid cell. The pnefeired panicqlars of the 
components used in the method ar^ those descnoed heremahove. 

A mammalian csell exhiDiiing isoform-speofio expression achieved by ^ny of the methods 
of the inventton is also provided. 

As the isofomn may oe used to correct at>errant isoforms, also provided is a memoo for 
treating a disease comprising administenng to a subject m need of treatment an effective 
amount of a nMCieic acid Deing at least a panially double-stranceo ribonucleic acid and 
the douPle-stranded portion having at least 05%. sequence identity to a comoion nucleic 
acid sequence shared hy two or more isofomis (in particular the aberrant isoform) of ^ 
gene pmdua; and an expression vector encx«ting a desired isoform of the gene product. 
The desired isofomri having a sequence compfHsing one or more mismatches relative to 
the doubie-^trandea portion of the nucleic aciO, operahly liOKed to a promoter capable of 
driving expression of the desired isoform in said eel}. 

The invention Is further described, for the purposes of iHustration only, in the following 
examples. Methods of mcdecuiar gmetics, protein and peptide biochemistry and 
immunology referred to tKji not explicitly descnbed m this discloaure and examples ore 
reported m the scientific iicerature artd are well Known to those sKllled in the arc 

Examples 

Example 1 inhibition (knack-dcwn) ofShcA in HeLa celts 

The signaling adaptor/scarfold protein ShcA is a member of the She fenrtily, nfvhvch 
consists of thnse genes. 8hcA, Shc8/S1i/Sck and ShcC/N-Shc/Rai (Luzi et ai.. 2000, 
Curr. Opin. Genet. Dev. 10, 668-574). StiCA is ubiquitously expressed, whereas ShcB 
and SncC are e^ressed ^pecificaity in the brain (Nakamura et al-. 1d98, j. Biol. Chem. 
273, 6930-6967 J. There are three isoforms of ShcA, p66ShcA, p52ShcA and p46ShcA. 
derived from a single gene through differential usage of transcription initiation sites (p66 
versus p52/p46) and translation start sites (p52 versus p4BX which differ only in the 
amino terminal regions (Luau et a!-, 2QQ0). We cnose ShcA to exemplify the invention as 
OifferenT cellular functions hav^ tseen suggested for each isoform and therefore is of 
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scientiftc inteiest to obtain isoform-specific expression of ShcA. However, it will 
apparent to one of ordinary sMIl in the art that ihe inventK^n is not limited in ^ny way to 
this particular g^ne faiYiily 

The pnmary transcript of p52/p46 mRNA contains the entire sequence of p66 mRh4A; 
however, tne very 5' regjon of p56 mRNA Is present m tne first mtron of p52/p46 mRNA 
Out is aDsem in the latter mRNA, having tJeen spliced out. The p46 and p52 isoforms are 
derived from the same mRNA using different translation initiation sites. The pOe^siRNA 
site i$ in the 5' region only present in pBS ShcA mRNA, wWIe h/m-shc SiRNA site is in 
ihe second e»on which is present in both mRNAs Note that the sequence of peC-shc 
siRNA is ftom a 5' region or human p66Snw mRNA and is atasent in p52/4e ShcA 
mRNA. 

Cells were transfected us^ng the OfigofectAMlNE Reagent without siRNA or with h/nv 
she siRNA and |uc siRNA. Briefly, Hel^ cells were cultured m Dult^ecco's Moaified 
Eagle's Medium (DMEM. Giboo-BRL) supplemented with 10% (v/v) fetal catf serum 
(AMIMED), 0.2 mg/ml streptomycin and 50 units/mi penicillin at 37*C \n a humidified CO3 
(5%) inculiator. One day before transfection wiih siRNA, cells were plated in 6-weil 
plates in media without amttriotics at 1 Av^iQ^ cells per well 

Tne followtng 2lHTier oligQnhQnucleotide& p^'r$ were used as siRNA: h/m-shc siRNA 
fr^m nt 677-697 (in the PTB domain), S'^UA CUU GGu UCG GUA CAU GQQ-3' (SEQ 
ID NO:1 ) and ff-CAU QUA CCQ AAC CAA GUA GGA-y (SEQ IP N02), Sequences 
were derived from the sequence of human p66ShcA mRNA (accession number 
MSU7377) and its complement and eacS> pair has a 3* overhang of 2 nt on eatfi side. 
Designed RNA oligonucieofldes were blasted against Database (GEMBL) to ensure 
gene spedficliy. The RNA oligonucleotides were omameo from Microsynin (Balgach. 
Swteenand). Annealing was performed according to Elpashir et al. (2001 » NatMre 411, 
494^98). The complementary two strands (eacn at 20 |jiM) in 200 of annealing buffer 
(100 mWI pota$smm acetate, 30 mM mEPES-KOM pH 7.4, 2 mM magnesium acetate) 
were heated for 1 mm at 90*C and then Incubated for 1 n at 37"C. An siRNA 
corresponding to nucleotides 753-773 of the firefly luciferase mRNA (luc siRNA) was 
used as a negative control 
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e>RNAs were inireduoeei into HeLa cells ustnfl ifte Ouaofeo^MlNE fteagent tufe 
Tecnnologies) according » me manufeclurer-s instrucftons. v,lth 10 hI «f 20 siRNA 
and 3 til of transfectioo reagent per well. At different times after transfection. whole-cell 
e^aracts were prepared and analyzed by Westerr^ blotting. CeUs were lysed In a buffer 
oomairfng 120 mM Nad. 50 mM Tris pH 8.0. and 1% NP40 plus Ccmpieie (Roche) 
protein inh.&lWf tablets. The ^vhole-cell extracts (20 1*9) were anaiy^ by Western 
wotting for She A. p-hibalin and Grb2 using a polyclonal rabbit ani5-Shc anttoody (1250; 
Transouciion Uboratories), mouse monoclonal anti-efb2 (1 :1000: Transduction 
Uboratofies) or a moMse monoclonal anii-p tubuUn anfibody 11;10QO; SjQnna). We used 
anti-rabbit or anii-niouse HRP-finked antibodies from Amefsham as seoondaiy 
antibodies- An enhanced chemiluminescence detection method (ECU Amersham) wd* 
employed and the membrane was exposed to KodaK Xomal US film. Quantitieaiioo of 
ShcA pro^ns was done using ImageQuant 5.0. 

When HeUa ceils were transfected with the h/m-shc slRNA. the levels of ail three ShcA 
isoforms were strongly reduced 24 h after transfection and reached less than 20% of the 
control after 48 h and 4% after 60 h. The level of control protein (^-tubulin was not 
affected under the conditions employed, and this was also the case for Grb2, a protein 
that specifically interacts with ShcA upon activabon of growth-factor signaling (Uuzi et au, 
2000). Luc SiRNA had no effect Time-ooufse analysis showed that the levels of all SncA 
isofomTs remained low unfil the fifth day after iransfeoion but started to increase 
tnereafter- The reducfion in the three isoforms was uniform, auggesfiog that both 
p52/p48 and pBB ShcA mRNAs wer« equally targeted by the sIRNA, According to the 
invention, repeated transfection maintains or re^iablishes the low levels of ShcA 
isoforms. 



E3ample 2 tsoformspwii^ laiockfiQwn 

The selective inhibition of the ShcA p66 isomer is demonstrated m this example using 
SiRNA specific for pBS. Cells were left untreated (control) or transfected without siRNA 
(control) or w,m h/m-shc SiRNA, pBfrsnc s.RNA (from nt 235-256 in itie CH2 domam. S'- 
GAA UQA GUC UCU GUC AUC GUCS-. SEQ lO NO S; and 5'-CGA UC3A CAG aQA 
cue AUU CCS-3'. SEQ IP NO:4) and control luc siRMA essentially as described In 
Example i Whole-oeli extnacis were prepared 48 h later and analysed for ShcA and ^ 
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tubglin levels as m Exampld 1. When »Hs were tiansfected wRh p6S-shc siRNA. only 
the pSeshcA isofartn was refltjcecj with Kinetics similar w that obtained with h/m-shc 
siRNA; the other wo isofoms were ncji affected Slmilarty, p-tubulin levels were rtot 
affected. The mvel of p66ShcA m cejls challenged a second time with the p$S-shc sIRNA 
6 days after ihe initial trensfection, when the level of p66ShcA was veiy low but about to 
increase, and 10 days after the initial transfecrion, when the level of p66ShcA had 
recovered substantially, remained low and decreased markedly again ^fterwansfection 
at days 6 and 10, respecttvely. 




This example illustrates the specificity of the sIRNA effect with the effect of p66-5<* sj 
RNA being restricied to We p66ShcA isoibmii. Exfwewlon of pS2;p46 SncA was not 
aflected although p66 and p92/p46 mRNAs snare sequence Identity in most of the 
region 3* of ine siRNA site, indicating that the silencing signal ooes not propagate to 
regions of mRNA 3' to the siRNA in mammalian cells. 

In addition, the sequence of p66-shc siRNA is present in the primary transcripts of 
p52/p46 SncA mRNA but is spliced out of the mature mRNA. Although not wishing to be 
bound by theory, the facA that only the p6Q ShcA isofomi was downreguiated by pfiB-shc 
siRNA strongly suggests that the site of aaion of siRNA is confined to the cytoplasm ana 
to the spliced mRNA, |f siRNA acted in the nuc^us and triggered the decay of 
transcnpts contairiing the coiresponding sequence, the p52ShcA and p46ShcA proteins 
should have been equally downreguiated t>y me same pSS-shc siRNA, which was not 
the case. 



Brampfe 3 Isofaan-specWQ expr^lm afShcA 

This E;;ample illustrates a rapid, alternative approach for isoform-specrfic gene 
expression. The previous Ej«ampies snow how the adaptor protein ShcA can be 
suppressed in an isoform-spedfiic manner in a human cell fine. The siRNA wim a 
sequence shared by me two ShcA imnscrlpts suppresses pee. p52 and p46 <see 
Example 1 ). However, an siRNA whose sequence is present only in pBS mRNA 
suppresses only the p66 isoform (see Example 2j. suggesting that the siRNA signal 
does not propagate to me 3' region of the target mRNA m mis Example, the expression 
of individual isofomis is achieved by first dovimregulating all isofomits by the common 
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(Wm she) siRNA (a$ in Example 1 ) and tuen transfacting ««m an expression veoor for 
the cjBBire0 isoform hartJonng s»ent mutations at the site corresponding w tne> h/m-shc 
siBJvjA This allows functional analyisis of indiviqual ShcA isoforms or indeed any other 
gene encoding muWpie protein$. 

The fuii-Jength mouse p4«. p52 and pseShcA cDNAs were isolated ffonn NIH3T3 ceJIs Uy 
reverse transcriprase-pojymerase cham reaction using the sense primere ff-CQQ AAT 
TCA TGG GAC CTG GGG TTT CCT ACT-3' (S6Q ID NO:S), 5*-CGQ AAT TCATGA 
ACA AGC T6A <3TG GAG GCG-3' (SEQ ID ^0:6} and ff-CGG AAT TCA TGG ATC 
TTC TAC CCC CCA AGC CGA AGT A-3' {SEQ ID NO:7>. raspeciively. and the common 
anbsense pnmer 5'-CGQ AAT TCA CAC TTT CCG ATC CAC GGS TTC ca- (SEQ ID 
NO'S) FulWenflth ShcA cDNAs were mitiaily cionea Imo pBlMescriptll KS+ and 
nudeotjde sequences verified by the dideoxynudeotide chain termination procedure. 

An HA-taggea expression vector pcDNA3MA was constructed by inserting the 
overtapping cligonucieotide pair SvCCC ACC ATG GCT TAG CCA TAC GAT GTT CCA 
GAT TAC GCT G-3' (S£Q ID NO:9) and 5 -AAT TCA GCG AAT TCT GGA ACATCG 
TAT GGG TAA CCC ATG QTG GGG TAC-3" (SEO lO NO lO) imo the Kpni-EcoRl ste 
of pcDNA3 (Inviirogen). To construct expression vectors for riA-«ggad ShcA. p46HA. 
p52HA and pBBHA. the lulMengm cDNAs of p46. pS2 and p56 were mseriEd into the 
EcoBI-EcoRV site of pcPNASHA. SncA mutants in whicn potenfial internal inifiation 
methR3rMne codons were converted to leucine codons. thus expressing only the pCSShcA 
or 52ShcA forms, were created using the QuicKChange site-direrted mutagenesis kit 
(Stratagene). The overlapping oiigonudeotide pair 5'-CTC CTC CAG GAC CTG AAC 
AAQ CTG AGT G-3' (SEQ iP NO:l 1 ) and 5'-CAC TCA GCT T6T TCA GGT CCT GGA 
GGA G-3' {SEQ ID NO:l2) was used to mutate methionine 65 (start site for p52) to 
leucine in p66HA, resutung in p66HA-mi. Anotner overlapping oligonudeohdc pair. 5- 
CCA ACG ACA AAG TCC TGG GAC CCC GGG-S' (SEQ ID |S|0:l3)and S'-CCC CGG 
GTC CCA GGA CTT TGT CGT TGG-3' (SEQ iO NO:l4). was used to nfjutate the 
initiation sites for p46 in both p6eHA-m1 and p52HA. resultfttg in pfiBHA-MU and p52HA- 
MU. Silent mutations v^re introduced mto these vectors at the sites corresponding to 
h/rrv^hc SiRNA as above MSing the overlapping oiigonucieolida pair 5 -GGG GTT TCC 
TAC TTQ GTC CGCTAC ATG GGT TGT C-3' (SEQ ID N0.15} and 5"-CAC AAC CCA 
TGT AGC GGA CCA AGT AG6 AAACCC C-3' (SEQ ID (mutated nucleotides 
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Mnc|erlined)to give p46HA-sm. p52HA-MI.-«m and p66HArMU-sm. Note That proteins 
express^ from these vec^ am identical to the parent proteins. 

For isoform-spBafic expression, HeU cells were first transfecied with no siRNA (mock) 
or with h/m-shc siRNA to aowivegulate all three endogenous ShcA proteins (isoforms) 
essentially as described above. Two days mer. the ceils were iransfected with an empty 
expression vector. pCDNA3, or an expression vector for the desired isoform of wild-type 
mouse ShcA or silent mutant ShcA using Lipofeetamlne 2000 (Ulffe Technologiea) 
accotding to the manufecttirer's inawuctions. One day later, whole-cell extracts were 
prepared and analyzed Py Western Dlottmg for the ShcA expression level Membranes 
were plotted w»iih polyclonal anti-ShcA ana anti'p lupuim antttuxiies. 

As already oemonstrated in Example 1, h/m-shc siRNA KnocKed down all three isoftsmis 
of endogenous SncA almost completely. Transfection of mese cells mutant 
expression vectors for individual isoforms resulted in the expression of only the 
corresponding isoforms. Almost no protein was detected in cells transfected with wild- 
type expression vectors demonstrating that the siRNA acts on the exogenous ShcA 
expressed l>y the vectors. Cells which were not transfected with h/n»«hc siRNA 
expressed elevated levels of ShcA isofomns, irreapeeUve of the presence or atisence of 
mutations in the expres^on vectws. m contrast. siRNA did not act on the mutated 
sequences allowing isofomfi. specific expression of any She A isofomt- Ceiia irasfected 
with pBSHA-iWIU-STn dearly snowed specific expression of only p66, whereas p46HAr«m 
and p52 MA-MU-sm resmted in expression of only p46 and p52. respectively. 

The present inventors have snown that siim* can efficiently, spedficaiiy and rapidly 
downregulate the level of an endogenous protein in mammaBan cells. The effect of 
siRNA was restricted to mRNAs containing a sequence essentially identical to the siRNA 
useq. That the pnnrary action of siRNA on mRNA, whic*» is most iiKely en 
endonucieoiytic attach, does not propagate to other regions of the target mRNA was 
inferred from the fbnouwing obsew ations: n > the etfect of pe&«hc siRNA was restricted to 
p66ShcA {Example 2)and (2) n/m-SnoA siRNA targeted wi Wype ShcA mRNAs put not 
mutant mRNAs {Example 3). The second opservatton was with eclopically expressed 
ShcA mRNAs. 
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in this experiment, sequences of «lld-type ShcA mRNA and mutant ShcA mRNA for 
each isoform were Identical exceptor two nuciootWes at the siRNA recognifion site 
located in ttie middle of me mRNA. These two nucieottde cahgnes were sufficieiil to 
avoid the SiRNA effect on me sxpwssion of me desired isolbrm. If the silencing signal 
would spread either 5' or 3' of the siRNA. ShcA expression from both wild-^ and 
mutant mRNAs would na« t»een suppressed, as at least some of these sequences 
would De Identical. That expression fiom wHd^ype mRNAs but noilirom mutant mRNAs 
was suppressed strongly argues for smngent spe<Sficiiy of siRNA-mediaied mRNA 
decay and no propagation of RNAi in mammahan cells. This is fn stark contrast to results 
of experiments man insect cell-ffee system (Upardi et ai . 2001 . Cell 1P7, 297-307) and 
C. etegdos <Qii*o et al., 2QQ1 , Cell 107, 465^76) lhat have suQgested that lanQ dsRNA 
molecules synmesized by KNA-dependent RNA polymerase are intermediates In RNA. 
that amplify and maintain the etfea of slR^4A. implying 5-spreading of me sllencng signal 
from siRNA, If amplification is involved in silencing in mammalian calls, antisense RNA 
oligonucleotides alone should sen/e as a primer for RNA^iepenoent RNA polymerase 
and, mus. be suffiaentfor gene downiBguiatjon. We (bund that antisense RNA di4 not 
induce silenong. 

An important Impucailon of our results is that the comoinaiion of PTfiS usma siRNA and 
me isoform^pecific exprBssion of an homologous gene wim silent mutations causes the 
cell to express only one isofbrm. while Keeping me levels of omer isoforms very low. 
using mis KnocWovwHn memod, it snouw be possible to examine the effect of various 
mutations of individual isoforms on cellular activity The isoform sequences in the 
expression vectors should be designed so mat they are not recognized by the siRNA 
affecting expression of the cellular genes. Two mismatches in me mutated expression 
vector were sufficient for mem to be tree of siR^lA^nedlated suppression We used 
expression vectoi* for mouse ShcA in mis work because mere is a high degree of 
sequence Similarity Deiwean mouse and human ShcA {90% mRNA sequences and 93% 
amino acid sequences for pB6 isofbrm). Thus, instead of introduang mutations into 
expression vedofs. it would have been possible to des.gn a different siRNA wim a 
sequence matching perfectly me endogenous human ShcA gene but not the transfected 
mouse gene, and achieve similar results. The advantage of our approach, however, is 
mat we can obtain isoform-specific sncA expression in bom human and mouse ceiis 
using *e same set of probes 
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In summary, we have shown In mammalian oalls tnat m& site of action of siRNA- 
mediated mRNA degradation is confined to the cytoplasm and that the tanget mRNA is 
restricted to those mRNAs containing a sequence essentially identical to The siRNA 
used- These specific feaiures of sil^NA-mediaiea gene knocK-down can be employed 
over a short time period in coniMnction with specific expression vectors to establish 
conditions for expression of the ShcA gene m an isofonnrwspecific manner- This 
KnocKdown-in meihoa should he applicahle and useful for the study of any gene 
expressed as multiple isofomis. 
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Claims: 



1 , A method of expressing a ciesired isoform of a gene product m a cell B»»seni 
undesirecl isolbrms of a gene proquci. said methisd compnsing: 

{a) exposing a mammalian ceil to at least one nucleic acid, said nucleic 
acid neing at least a partially daut)le-stranaed nbonudeic acid and the 
dQMWe-«tranded portion having at least 95% sequence identity to a 
cotYwnon nudaic acid sequence shared by two or more isoforms of 
said gene product: and 
(b) mtroducing an expression ve«or encoding a desired isoform of sa<d 
gene product into said mammalian ceii, said desired isoform nav,nQ a 
sequence comprising one or more mismatches reiative to said douWe- 
snanded portion of said nucleic acid. operably linKed to a promoter 
capatMe of caving expression of said desired isoform in said cell 

2. The method of claim i, wherern said common nucieic acid sequence is at least i9 
consecutive nucleotides <n length. 

3. -me method of claim n or 2, wnerein saul common nuc|e,c add sequence is common 
to ail endogenous isoforms of said gene produa m said ceH. 

4. The method of any one of claims 1 to 3. whetern the douBie-stranded portion of said 
nucleic acid IS 100% identical lo said common nucleic add sequence, 

5. The method of any one of daims \ to 4. wherein saic nudeic add is ^9 to 25 
nudeotides long. 

6 The method of any one of daans i to 6. wnerem said at least paniaRy double- 
stranded ribonucleic acid comprises a doul>le-stranded portion of at least 19 
nudeotides and at least one two-nudeotide single-stranded 3- overhang. 

7 The method of any one of datms 1 to 6, wherein said des.red isofcrm comprises a 
sequence compnsmg t-yo or more mismatdies reiati,« to said douDie-stranded 
portion of sajd nucleic acid 
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8, The method of ^ of claims 1 to 7, wherein said expression vector encodes said 
desired isoferm using at least one codon that differfi from the endogenous sequence 
coding »id desfoeO isoform. 

9, The memod of claim 8, wherein said expression vector encodes said desired isoform 
ustnQ two codons that differ from tiie corresponding endogenous sequenoe coding 
said desired isoform- 

10, The method of claim 8 or 9. wherein said desired isoform has an identical protein 
sequence to the corresponding endogenous isoform. 

1 1 Tne m^mod of any one of claims 1 to 10, wnerein said desired isoform replaces a 
mutant Isoform in the cell. 

1 2. The method d[ claim 11 , wherein said mutant isoform is oncogenic apoptotic, tumor 
suppressiva, inflamn^tion indMCive or supF^sdive, or angiogenic. 

13. The method of any one of claims 1 to 12. further comprising determining me function 
of said desired isoform- 

14 Tne method of any one of claims i to 13, wherein said cell is a cancer cell. 

1 5- The m^hod of clatm 14, wherein said cell is selected from the group consisting of 
HeUa (cervical cancer). PC3 (prostate cancer), MDA-MB-231 (hPBasi canoer) and 
MCF-7. 

16 The method of any one of claims 1 to iS, wherein said desired isoform is uanscrfbed 
under the control of an endogenous promoier. 

17.T1ie method of any one of claims i lo 16, wherein said expression vector comprises 
a constitutive promoter operably linKed to said desired isoform. 

1 S. The method of any one of claims 1 toi6. wherein said expression vector comphses 
an inducible promoter operaWy linKed to said desneo isofonm. 
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19. Hfie metnod of any one of claims l tol6. wnerein said express'on vector comprises 
a tissue-spetific promoier cperaDly hnKed to said desired isoform. 

20. A kit compn^ing reagents expressing a desired isoform of a gem product in a cell 
absent undesired isoforms of a gene prodyct, wherein $afd hit comprises a nucleic 
acid toeing at {east a partially douDie-stranded rit>onuci&ic acid and the double- 
stranded portion Mavmg at least 95% sequence identity to a common nudeic acid 
sequence shared by two or more isoforms of said gene product; and an expression 
vector encoding a desired isoform of said gene product, said desired isofomi having 
a sequence comprising one or more rnismatches relative to said oouble-stranded 
ponion of said nucleic acid, operably linned to a promoter capaiiie of oriving 
expression of said desired isoforrn in said cell 

21 . A mammalian cell exhibiting isofQrm**specif ic expression achieved by any of the 
methods of claims 1-19. 

22. A method for treating a disease comprising administering to a subject in need of 
treatment an effiective amount of a nudeic acid being at least a partially douhle- 
stranded ribonucleic add and the double-stranded ponion having at least 95% 
sequence identity to a common nucleic add sequence shared hy two or mor& 
isoforms of said gene product; ar>d an expression vector encoding a desired isoform 
ot said gene product, said desired isoform having a sequence composing one or 
more mismatches relative to said doubie-stranoeo ponion of said nucleic acid, 
operably linKed to d promoter capable of dnving expression of said desired isoform in 
said ceil. 

23. A method of assigning function to a desired isofoon. said method compffsing: 

a) exposing a mammalian cell to at least one nucteic acid, said nucleic 
aod being at least a partla)ly aouble-stranded ribonucleic ac5d and the 
dowble-stranaed porttan having at least 95% sequence identity to a 
common nucielc aad sequence shared by two or more isoforms of said 
gene product; 

b) exposing said mammalian cell xa an expression veaor encoding a 
desired isoform ot said gene product, said desired isoform having a 
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sequence comprising one cr more mismatches relative tci said double- 
stranaed portion of s^ici nucleic ac^d. operably linked to a promoter 
capdPl& of dnvlng expression of said desirad Isoform in said cell; 

c) identifying a pnenotype of said mammalian cell compared to when said 
desired isoform is a]>sam. and 

d) assigning ?aid phenotype to said desinsd isofomft 
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ABSTRACT 

Methods of obtaining Isoform sneeific expres sion in mammaHan cells 



A method »s provided for isoform specific gene expression in a mammalian cell in me 
absence of other iscforms. Tne method uses RnAi to achieve the specific expressioa 
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